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ABSTRACT 



An architecture for capture and generation, and a set of 
methods for characterization, prediction, and classification 
of traffic in packet networks are disclosed. The architecture 
consists of a device that stores packet timing information 
and processes the data so that characterization, prediction, 
and classification algorithms can perform operations in 
real-time. A methodology is disclosed for real-time traffic 
analysis, characterization, prediction, and classification in 
packet networks. The methodology is based on the simul- 
taneous aggregation of packet arrival times at different times 
scales. The traffic is represented at the synchronous carrier 
level by the arrival or non-arrival of a packet. The invention 
does not require knowledge about the information source, 
nor needs to decode the information contents of the packets. 
Only the arrival timing information is required. The inven- 
tion provides a characterization of the traffic on packet 
networks suitable for a real-time implementation. The meth- 
odology can be. applied in real-time traffic classification by 
training a neural network from calculated second order 
statistics of the traffic of several known sources. Perfor- 
mance descriptors for the network can also be obtained by 
calculating the deviation of the traffic distribution from 
calculated models. Traffic prediction can also be done by 
training a neural network from a vector of the results of a 
given processing against a vector of results of the subse- 
quent processing unit; noticing that the latter vector contains 
information at a larger time scale than the previous. The 
invention also provides a method of estimating an effective 
bandwidth measure in real time which can be used for 
connection admission control and dynamic routing in packet 
networks. The invention provides appropriate traffic descrip- 
tors that can be applied in more efficient traffic control on 
packet networks. 

21 Claims, 11 Drawing Sheets 
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METHOD FOR REAL-TIME TRAFFIC 
ANALYSIS ON PACKET NETWORKS 

This application claims priority under 35 U.S.C.119 from 
Provisional Application Sen No. 60/034,607 filed Jan. 3, 5 
1997. 

THE FIELD OF THE INVENTION 

This invention relates in general to traffic analysis on Q 
packet networks. Packet networks are telecommunication 
networks in which the information is transmitted in small 
binary groups called packets. An advantage of packet net- 
works is that it can handle different sources simultaneously 
by processing the packets sequentially. The packets travel J5 
through the network via a fast synchronous carrier; this can 
be viewed as a train of pulses which transport the packets. 
The speed of this carrier determines the maximum speed of 
the packet network. These networks can process only one 
packet at a time and thus the order in which the packets are 2Q 
processed depends on the priorities and quality of service 
required by the sources. Packets from a given source are 
mixed with packets from other sources. Each packet has a 
header that identifies its destination. Once the packets arrive 
at their destination, the headers are removed and the infor- 25 
mation is reassembled. Since these networks generally carry 
traffic from different types of sources which demand differ- 
ent service levels from the network, it is important from a 
network operation and management point of view to know 
the characteristics of the expected traffic. In the standards for 3Q 
packet networks that use the asynchronous transfer mode 
protocol (ATM), the traffic is described by first order statis- 
tics such as the peak cell rate, cell delay variation tolerance, 
sustainable cell rate, and maximum burst size. The size of 
the packets in ATM networks (that is networks which utilize 35 
the ATM protocol) is 53 bytes, of which 5 bytes compose the 
header and the remaining 48 bytes contain a section of the 
information being transmitted. The rate is defined as the 
number of packets that flow through the network in a given 
unit of time; this is a measure of the speed of the telecom- 4Q 
munications network. The peak cell rate is defined as the 
inverse of the minimum time between successive packet 
arrivals to a switch. The cell delay variation tolerance is 
defined as the sensitivity of the information to changes in 
delay of the packets as they flow through the network. The 45 
sustainable cell rate is defined as the maximum average rate. 
The maximum burst size is defined as the maximum number 
of cells at the peak rate. Two of these parameters, the peak 
cell rate and the sustainable cell rate, have been defined as 
mandatory traffic parameters (or descriptors) in the ATM 5Q 
Forum UNI version 3.0 standards, as explained by McDysan 
and Spohn [11]. 

There are two types of networks which differ according to 
the way a connection is handled. The first type is connection- 
oriented. In these networks it is required to set up several 55 
parameters before any data transmission can take place. This 
is a process in which a source negotiates a level of service 
with the network. An end-to-end path with a quality of 
service is established and all the packets from the source will 
follow this path. The second type is connectionless in which 60 
it is not required to set up an end-to-end connection. The 
network bandies the packets individually. 

Quality of service (QoS) is a parameter meaningful from 
a source to a destination point of view, as well as at each link 
in the network. In connection -oriented networks, the nego- 65 
tiation for a QoS is carried out by agreeing on certain 
parameters. These parameters are based on first order sta- 
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tistical measures of the performance, such as the average 
delay, cell delay variation, error rates, and different levels of 
packet loss, as explained in the Bellcore requirements for 
broadband switching systems [2], 

Packet networks have been designed to carry traffic from 
multimedia sources, among them different types of video 
and audio, voice, and data communications. Each traffic 
source presents the network with different requirements. The 
network must be able to handle all these traffic sources at 
their respective quality of service. The problem that arises is 
how to accurately characterize the different traffic sources 
for efficient network utilization. It might, in fact, be required 
to measure quality in different ways for different traffic 
sources. This makes the performance measurement problem 
very complex. 

In our invention, a method is used to characterize the 
traffic in real-time. The method is used to calculate traffic 
descriptors considering properties of the traffic which have 
not been considered previously in commercial equipment. 
The descriptors are based on properties of the traffic that 
have been reported in the literature, but the techniques 
available are not suitable for real-time measurements. The 
algorithm presented is based on the simultaneous measure- 
ment of the traffic at different time scales. The data is 
represented in an appropriate form, processed and organized 
in an array of vectors. From this array, higher order statis- 
tical measures are derived. The traffic descriptors calculated 
in this way are used to characterize the traffic. The algorithm 
is implemented in real-time. The information is also used for 
traffic classification and performance prediction. 

THE DESCRIPTION OF RELATED ART 

The following references have been identified in a search 
in this field, some of which are relevant to the present 
invention: 

Publications 

[1] R. Addie, M. Zukerman, and T. Neame, "Fractal Traffic: 
Measurements, Modelling and Performance Evaluation", 
in Proc. IEEE Infocom, pp. 977-984, 1995. 
[2] Bellcore, "Broadband Switching System Generic 
Requirements", GR-1110-CORE, Revision 3, April 1996. 
[3] J. Beran, R. Sherman, M. Taqqu, and W. Willinger, 
"Long-Range Dependence In Variable-Bit-Rate Video 
Traffic", in IEEE Trans, on Communications, vol. 43, no. 
4, pp. 1566-1579, April 1995. 
[4] Y. Chen, Z. Deng, and C. Williamson, "A Model for 
Self-Similar Ethernet LAN Traffic: Design, 
Implementation, and Performance Implications", internal 
report, University of Saskatchewan, Canada, 1995. 
[5] M. Devetsikiotis, 1. Lambadaris, R. Kaye, "Traffic Mod- 
eling and Design Methodologies for Broadband 
Networks", Canadian Journal on Electrical and Com- 
puter Engineering", vol. 20, no. 3, 1995. 
[6] M. Garrett and W. Willinger, "Analysis, Modeling and 
Generation of Self-Similar VBR Video Traffic", in Proc. 
ACM Sigcom, London, UK, pp. 269-280, 1994. 
[7] R. Guerin, H. Ahmadi and M. Naghshinch, "Equivalent 
Capacity and its to Bandwidth Allocation in High Speed 
Networks", IEEE JSAC, vol. 9, no. 7, 1991. 
[8] C. Huang, M. Devetsikiotis, I. Lambadaris, and A. Kaye, 
"Modeling and Simulation of Self-Similar Variable Bit 
Rate Compressed Video: A Unified Approach", in ACM 
Sigcom, Cambridge 1995. 
[9] W. Lau, A. Errarailli, J. Wang, and W. Willinger, "Self- 
Similar Traffic Generation: The Random Midpoint Dis- 
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placement Algorithm and its Properties", in Proc. IEEE 
Int Cbnf. Commun., 1995. 

[10] W. Leland, M. Taqqu, W. Willinger, and D. Wilson, "On 
The Self-Similar Nature of Ethernet Traffic (extended 
version)", IEEE/ACM Trans, Networking, vol. 2, no. 1, 
pp. 1-15, February 1994, 

[1] B. Mandelbrot, "Self-Similar Error Clusters in Commu- 
nication Systems and the Concept of Conditional 
Stationarity", in IEEE Trans, on Communication 
Technology, pp. 71-90, 1965. 

[12] D. McDysan and D. Spohn, "ATM Theory and Appli- 
cation". Toronto: McGraw-Hill, 1995. 

[13] D. McLaren and D. Nguyen, "A Fractal-Based Source 
Model for ATM Packet Video", in Int. Conf. on Digital 
Processing of Signals in Communications, Univ. of 
Loughbovough, September 1991. 

[14] V. Paxson, "Fast Approximation of Self -Similar Net- 
work Traffic", report LBL-36750, Univ. of California at 
Berkeley, Lawrence Berkeley Laboratory, 1995. 

[15] A. Rueda and W. Kinsner, "A Survey of Traffic Char- 
acterization Techniques in Telecommunication 
Networks", Proc. IEEE Canadian Conference on Elec- 
trical and Computer Engineering, pp. 830-833, May 
1996. 

U.S. Patents 

U.S. Pat. No. 5,050,161 Congestion management based 

on multiple framing strategy 
U.S. Pat. No. 5,274,625 Traffic measurements in packet 

communication networks 
U.S. Pat. No. 5,341,366 Connection admission control 

system 

U.S. Pat. No. 5,343,463 Performance measurement sys- 
tem for a telecommunication path and device used 
therein 

U.S. Pat. No. 5,343,465 Method and system for real-time 
bursliness analysis of network traffic 

U.S. Pat. No. 5,357,507 Fast connection admission con- 
trol for ATM networks 

U.S. Pat. No. 5,357,510 Apparatus and method for super- 
vising and controlling ATM traffic 

U.S. Pat. No. 5,365,514 Event driven interface for a 
system for monitoring and controlling a data commu- 
nications network 

U.S. Pat. No. 5,375,070 Information collection architec- 
ture and method for a data communications network 

U.S. Pat. No. 5,394,394 Message header classifier 

U.S. Pat. No. 5,426,635 Method for adaptive control of 
windows and rates in networks 

U.S. Pat. No. 5,434,848 Traffic management in packet 
communication networks 

U.S. Pat. No. 5,448,567 Control architecture for ATM 
networks 

The traffic characterization techniques for telecommuni- 
cation networks found in the literature can be classified into 
the following categories [5], [14]: autoregressive moving 
average (ARMA) models, Bernoulli process modeling, 
Markov chain modeling, neural network models, self- 
similar models, transform-expand-sample (TES) models, 
traffic flow models, and wavelet models. 

Different kinds of stochastic (statistical) models reported 
in the literature have successfully been used in modeling 
traffic in telecommunication networks. For example, 
Markov chains are a useful tool in modeling communication 
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systems. It is widely accepted that the short-term arrival 
processes in telecommunication networks can be accurately 
described by Poisson processes, for example an FTP control 
connection which can be modeled as a Markov modulated 

5 Poisson process (MMPP) [13]. 

Traffic on packet networks is irregular in nature as 
explained by Leland, et ah [9]. It is generally accepted that 
the long-range dependencies found in multimedia traffic can 
be described using models which consider self-similarity. 

10 Self-similarity is a measure of the variation of the traffic 
properties at different time scales. Several traffic models of 
this type have been reported [1], [4], [6], [7], [8], and [12]. 

The traffic descriptors that have been utilized in packet 
networks are based on statistical measures such as the mean, 

is peak and sustained rates, burst length, and cell-loss ratios. 
These do not quantify correlation well, and thus a need 
exists for descriptors that provide more information in order 
to describe highly correlated and bursty (irregular) multi- 
media traffic [3] and [10]. 

20 Traditional analysis techniques cannot be successfully 
applied. The methods that have been proposed for the 
analysis of traffic in packet networks are impractical for a 
real-time implementation since they required calculations 
that involved several processing phases on a stored time 

25 series. 

A method for time deviation (TDEV) calculation for 
synchronization monitoring in SONET/SDH networks is 
proposed by Grove r and Stamatlakis [15]. Their method 
consists of the calculation of simultaneous averages of the 

30 second differences of time interval errors (HE) at different 
time scales. A time scale is represented by blocks that 
contain a number of second differences which is a power of 
two. For each block the second differences are added and the 
result is squared. The sums of the squares are then divided 

35 by six and by a constant to obtain a measure called the time 
variance (TVAR) for the time scale. Each new TIE value 
produces a new second differences and this is used to 
updated all the block sums of all the time scales. Each new 
second difference could complete a block sum for one or 

40 more blocks and a new TVAR could be obtained. The results 
are used to produce a plot of the logarithm base two of the 
TVAR values of all the blocks versus the logarithm base two 
of the block size. Their paper also details the standard block 
calculation which consists of calculating the TVAR values 

45 for all the blocks in a batch mode (or block mode, or 
off-line). This consists of calculating the block sums and 
TVAR values of a time series of TIE values for each block 
separately. 

[7] W. D. Graver and D. Stamatlakis, "Continuous TDEV 
50 calculation for in-situ synchronisation monitoring in 
SONET/SDH networks", Electronic Letters, vol. 29, No. 
16, pp. 1405-1406, August 1993. 

Namajunas and Tamasevicius [16] proposed a device for 
measuring fractal dimensions from a time series in real-time. 
55 Their device is an electronic circuit that produces an 
approximation to the fractal dimension of a class of analog 
electric signals in real-time. 

[8] A. Namajunas and A. Tamasevicius, "A technique for 
measuring fractal dimensions from time series on a real- 
60 time scale", Physica D, vol. 58, pp. 482-488, 1992. 

SUMMARY OF THE INVENTION 

It is one object of the present invention to provide a 
method for characterizing transmissions in a packet network 
65 which may or may not be carried out in real lime and may 
provide information which can be used to characterize the 
transmissions for prediction and access control. 
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According to a first aspect of the invention there is 
provided a method for deriving information related to char- 
acteristics of transmissions in a packet network comprising: 
providing a packet network for carrying a plurality of 
transmissions from at least one source in which the 5 
transmissions from the or each source are divided into 
a plurality of sequential packets each packet having 
address data defining an intended address, information 
data defining information to be transmitted and id data 
defining a source identity; 10 
the network defining a train of sequential packet transport 
locations into which packets are loaded for 
transmission, such that some packet transport locations 
in a train contain packets and some packet transport 
locations are empty and such that, when there is more 15 
than one different source, the train contains packets 
from the different sources in a sequential arrangement 
as determined by the network; 
monitoring a train of packet transport locations to deter- 2Q 
mine which packet transport locations are empty and 
which contain a packet; 
generating a series of data elements each corresponding to 
a respective one of the packet transport locations and 
each identifying whether the respective packet trans- 2 5 
port location is empty or whether the respective packet 
transport location contains a packet; 
and carrying out statistical analysis on the series of data 
elements to determine the characteristics of the trans- 
missions. 30 
Preferably the statistical analysis is carried out in real time 
so that predictions and characterization of the transmissions 
can be done in real time. 

Preferably, as an alternative, the data elements are stored 
for example in the hard drive of a PC for subsequent analysis 35 
and the statistical analysis is carried out subsequent to 
completion of the transmissions when a series of transmis- 
sions have been recorded for analysis. 

Preferably the statistical analysis is carried out at a 
plurality of different time scales in order to provide the 40 
calculations as set out hereinafter. Preferably the statistical 
analyses are carried out simultaneously by providing for 
each different time scale a respective one of a plurality of 
registers and entering information from the data elements 
into each register sequentially, 45 

Preferably, in a simple single mode, the data elements 
comprise data bits defining "0" when the respective packet 
transport location is empty and "1" when the respective 
packet transport location contains a packet and wherein the 
information for each register is obtained by adding the 50 
contents of a next adjacent previous register. 

Using the data bits, the analysis includes, for each register, 
calculating the sample variance of a set of successive 
observations of the register contents and from the variances 
estimating a value of the Hurst parameter H, which is the 55 
slope of a line which approximates the behavior of a plot of 
the logarithm of the variances of the registers versus the 
values of the sequential indices of the registers. This param- 
eter is known per se but the present algorithm provides a 
technique for calculating this parameter in real time. 60 

When the packets in the train are provided by a plurality 
of different sources the data elements are arranged to iden- 
tify a packet transport location as containing a packet only 
when the packet is identified from the id data as provided by 
a selected one of the sources such that the characteristics 65 
determined relate to only the transmissions from the selected 
source. 
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When the packets in the train are provided by a plurality 
of different sources the data elements are arranged to iden- 
tify a packet transport location as empty when no packet 
from any of the sources is contained and to identify when a 
packet transport location contains a packet from the id data 
as provided by each one of the sources which source 
provided the packet. 

The characteristics determined from the data elements can 
be used to provide a calculation of an effective bandwidth of 
the transmissions for use in access control and prediction. 

In particular the above method can be used in one 
example for determining whether a source additional to a 
plurality of existing sources of packet transmissions, each 
source having a predetermined peak rate of packet 
transmission, can be connected to a packet network, where 
a transmission medium of the network has a predetermined 
maximum allowable peak rate of transmissions. This is 
preferable' effected by the steps of: 

carrying out in real time the statistical analysis on the 
series of data elements to determine an effective band- 
width of the transmissions from the existing sources; 
and calculating whether the additional source can be 
connected by comparing the effective bandwidth, the 
predetermined peak rate of packet transmission of the 
additional source and the predetermined maximum 
allowable peak rate of transmissions of the transmis- 
sion medium of the network. 
As an alternative, the method can be used for calculating 
whether a source additional to a plurality of existing sources 
of packet transmissions, each source having a predetermined 
peak rate of packet transmission, can be connected to a 
second packet network, a transmission medium of the sec- 
ond packet network having a predetermined maximum 
allowable peak rate of transmissions. This is effected by the 
steps of: 

connecting the additional source to the packet network; 
generating said series of data elements from transmissions 

from said additional source to said packet network; 
carrying out in real time the statistical analysis on the 
series of data elements to determine an effective band- 
width of the transmissions from said additional source; 
and calculating whether the additional source can be 
connected to the second packet network by using the 
effective bandwidth of the transmissions from said 
additional source. 
According to a second aspect of the invention there is 
provided a method for deriving information related to the 
characteristics of transmissions in a packet network com- 
prising: 

providing a packet network for carrying a plurality of 
transmissions from at least one source in which the 
transmissions from the or each source are divided into 
a plurality of sequential packets each packet having 
address data defining an intended address, information 
data defining information to be transmitted and id data 
defining a source identity; 

the network defining a train of sequential packet transport 
locations into which packets are loaded for 
transmission, such that some packet transport locations 
in a train contain packets and some packet transport 
locations are empty and such that, when there is more 
than one different source, the train contains packets 
from the different sources in a sequential arrangement 
as determined by the network; 

monitoring a train of packet transport locations to deter- 
mine which packet transport locations are empty and 
which contain a packet; 
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generating information defining which packet transport source, nor needs to decode the information contents of the 
locations contain a packet and the empty locations packets. Only the arrival timing information is required. The 
therebetween; binary representation of the traffic, that is, the arrival or 
and carrying out simultaneously and in real time a series non-arrival indicator, is processed by a series of processing 
of statistical analyses on the information at a plurality 5 units and organized in an array of vectors. The processing 
of different time scales to determine the characteristics units are arranged in a series in such a way that the each 
of the transmissions. processing unit operates with the result from the previous 
In this aspect, the information is not necessarily the data unit. The first processing unit operates directly with the 
bits as set forth above but can include other information timin g information. The results of all the processing units 
relating to the population of the packets. However in this 1° are organized in the array of vectors and from these vectors 
aspect, the analysis is effected in real time. Preferably information about the traffic can be derived. Statistical 
however as explained in detail hereinafter, the information measures on the vectors such as variance, provide appro- 
comprises a series of data elements each corresponding to a pnate parameters at different time scales to calculate traffic 
respective one of the packet transport locations and each descriptors. These descriptors encapsulate different proper- 
identifying whether the respective packet transport location 15 lies °f lne traffic, such as burstiness and self-similarity, 
is empty or whether the respective packet transport location wnicn were previously not considered in traffic management 
contains a packet and wherein the statistical analyses are on packet networks. 

carried out by providing for each different time scale a A characterization is obtained of the traffic on packet 

respective one of a plurality of registers and entering infor- networks suitable for a real-time implementation. The meth- 

mation from the data elements into each register sequen- 20 odoiogy can be applied in real-time traffic classification by 

tially. training a neural network from calculated second order 

More preferably the data elements comprise bits which statistics of the traffic of several known sources. Perfor- 

define "0" when the respective packet transport location is mance descriptors for the network can also be obtained by 

empty and "1" when the respective packet transport location calculating the deviation of the traffic distribution from 

contains a packet and wherein the information for each 25 calculated models. Traffic prediction can also be done by 

register is obtained by adding the contents of a next adjacent training a neural network from a vector of the results of a 

previous register. given processing against a vector of results of the subse- 

According to a third aspect of the present invention the Quent processing unit; noticing that the later vector contains 

same technique for generating bits relating to the empty and information at a larger time scale than the previous. The data 

filled packets can be used in reverse in a method of gener- 30 produced by the algorithm can also be used for performance 

ating packet transmissions for simulating a source having prediction, 
required transmission characteristics for transmitting on a 
packet network comprising: 

providing a packet network for carrying a plurality of One embodiment of the invention will now be described 

transmissions from at least one source in which the 35 in conjunction with the accompanying drawings in which: 

transmissions from the or each source are divided into FIG. 1 illustrates the algorithm. The timing information 

a plurality of sequential packets each packet having from the synchronous line is represented by a '1' if a packet 

address data defining an intended address, information arrival has occurred, and a '0' otherwise. This information is 

data defining information to be transmitted and id data processed and organized in vectors. The different compo* 

denning a source identity; 4 nents of the algorithm have been labeled. 

the network defining trains of sequential packet transport pjQ, 2 shows an application of the algorithm in ATM 

locations into which packets are loaded for networks. This is a particular case of the algorithm. The 

transmission, such that some packet transport locations required values of the parameters are shown in the figure, 

in a train contain packets and some packet transport 45 The input information is SONET OC3/OC12, which is a 

locations are empty and such that, when there is more standard synchronous carrier for ATM. 

than one different source, the train contains packets FIG. 3 is a diagram showing the first method used to 

from the different sources in a sequential arrangement construct the H . mcter a i gor ithm input stream of l's and 0's 

as determined by the network; from ^ capture ceH timestamps . 

from a statistical analysis of previous actual transmissions 50 In FIG 4 mustrates the second method of re .p roducing 

on the network, generating for the simulated source the {h& AJM lfaffic stream ^ method eliminales lhe first four 

required characteristic; coluniQS of the afray 

generating frorn the required characteristic a series of data FIG. 5 describes the data representation process, 

elements each corresponding to a respective one of the „ fir> ., . , . 

packet transport locations and each specifying whether, 55 FIG * 6 deSCnbCS the dat3 S eneratl0n P rocess " 

in a simulated train of packets, a respective packet FIG - 7 represents the data representation process for two 

transport location is empty or whether the respective channels. 

packet transport location contains a packet; FIG. 8 shows a block diagram of the implementation of 

and creating from the data elements the simulated packet the algorithm in a parallel processing platform with four 

train. 60 P™*ssors. 

Thus as set forth above, a methodology is disclosed for FIG. 9 shows a block diagram of an optimized imple- 

real-time traffic analysis, characterization, prediction, and mentation of the algorithm in a parallel processing platform 

classification in packet networks. The methodology is based with three processors. 

on the simultaneous aggregation of packet arrival times at FIG. 10 depicts an application of the algorithm in char- 
different time scales. The traffic is represented at the syn- 65 acterization of moving images. The coefficient of variation 
chronous carrier level by the arrival or non-arrival of a of the vectors has been plotted for different quality factors of 
packet. The invention does not require knowledge about the the transmission. The plot shows the logarithm of the 
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coefficient of variation of the vectors against the logarithm 
of the 2 to the power of the vector number. Using this 
technique it can be determined how the statistical properties 
of the traffic vary as a function of the quality level of the 
video compression. 

FIG. 11 illustrates an application of the invention in traffic 
classification. The vertical axis is the log of the variance, and 
the horizontal axis is the log of the bin size index. The plots 
represent typical variance-time curves of transmissions of 
video, audio, and ftp (file transfer protocol) signals. A 
variance-time curve is a plot of the logarithm of the variance 
of the vectors against the logarithm of two to the vector 
number. The plot for each stream is clearly distinct from the 
others. Since the plots can be constructed in real-time as the 
vectors are being calculated, this technique can be applied in 
real-time to characterize the traffic from measures of the 
variance-time curves. 

FIG, 12 illustrates the process of generating the Hurst 
parameter H from variance time curves. The Hurst param- 
eter is calculated at different time intervals and a plot is 
constructed with the different values of H. The resolution of 
the graph depends on the lenth of the time intervals. 

FIG. 13 shows an application of the algorithm for calcu- 
lation of the H parameter of the 'jump to hyperspace' scene 
of the StarWars movie. At the centre of the plot (left-hand 
side image) it is seen that the H parameter increases in value 
as the images changes. This is a motion JPEG encoded 
transmission of the video over an ATM network. 

FIG. 14 illustrates the application of the algorithm in 
classification of different traffic streams. A scene with the 
character R2D2 from the motion JPEG-coded StarWars 
movie over ATM is compared agains an audio transmission 
of the same movie. 

FIG. 15 shows that application of the algorithm to the 
observation of scenes of high activity from the motion 
JPEG-coded StarWars movie. 

FIG. 16 shows the detrended data as presented to the 
neural network classifier. 

FIG, 1 7 depicts a diagram of the competitive learning- 
based neural network that solves the data classification 
problem. 

FIG. 18 shows the after-training weight vectors of the 
neural network classifier indicated by a *o\ and input 
vectors indicated by a '+'. 

FIG. 19 illustrates one method of calculating maximum 
queue length. 

FIG. 20 illustrates the method of calculating maximum 
queue length employed in the estimation of effective band- 
width. 

FIG. 21 illustrates the calculation of the long term effec- 
tive bandwidth projection. 

DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT 

Referring to FIG. 1, the data is processed and organized 
by an ordered series of bins, and stored in an ordered array 
of vectors in such a way that information can be derived 
regarding traffic at different time scales. Each bin consists of 
a primary accumulator and a secondary accumulator. A 
series of bits representing the presence "1" or absence "0" of 
a cell or packet in successive transport locations of a traffic 
stream is fed into the primary accumulator of the first bin of 
the series. The bins are labelled with the integers {1,2, . . . , 
m} and the accumulators are labelled by pairs of integers of 
the form (i j) so that the accumulators in bin i are labelled 
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(i,l) and (i,2) for the primary and secondary accumulators 
(respectively) of bin i. Each accumulator has a number of 
memory locations and the number of memory locations in 
accumulator (i j) is represented by p y . 

When an accumulator receives an integer, it stores the 
integer in the next available (i.e. empty) memory location if 
one is available. If all of the memory locations are full, then 
the accumulator computes the sum of all the integers in its 
memory locations, outputs the result, clears (erases) all of its 
memory locations and writes the new integer into the first 
memory location. When a primary accumulator outputs an 
integer, it passes it to its corresponding secondary accumu- 
lator (i.e. the other accumulator in its bin) and to the primary 
accumulator of the next bin in the series. When a secondary 
accumulator outputs an integer, it passes it to the vector 
which corresponds to its bin (i.e. the vector labeled with the 
same index). 

The array of vectors s c is used to store the information 
processed by the bins. Each vector can store n elements, 
where n>2 is a constant. 

The bins are configured in two possible ways: a) as shift 
registers, or b) as recursive registers. The shift register 
receives a new element always in the first position and shifts 
down data already stored. The recursive register stores a new 
element in the position of the array that holds the oldest 
element by replacing it. In both cases, the oldest element is 
then sent to the vector s c for storage. Bin c is cleared every 
time it has received q c or p^ elements (depending on the 
case chosen) and a datum has been calculated. 

Similarly, the vectors s c are configured in two possible 
ways: a) as an array of shift registers, or b) as an array of 
recursive registers. In the first case, a new element coming 
from bin c is stored in the first position of s c and the other 
elements are shifted. In the second case, a new element is 
stored in the position of the oldest element. In both cases the 
oldest element is dropped. 

The algorithm is organized such that the integers passed 
to the vector s c of the array are the values of sums of n c 
successive bits of the binary stream fed into the primary 
accumulator of bin 1 where n c is given by 

These numbers thus represent the number of packets 
arriving in successive time intervals of duration n c each. We 
refer to n c as the bin count for bin c. 

The vectors s c contain timing information at different time 
scales. This information can be used for characterization, 
prediction, and classification of traffic. In the case of OCn 
traffic analysis, the traffic can be encoded into a stream of 
'l's and '0's representing an arrival or non-arrival of a 
payload cell belonging to a particular VPI/VCI (virtual path 
identifier/virtual circuit identifier). Timing information of 
multiple connections is denoted by multiple bits. The time 
between two consecutive bits (or group of bits) is the 
minimum cell spacing time, i.e. maximum rate of the 
SONET line. This method can also be used to generate traffic 
from pre-loaded algorithms. This can be very useful for 
stress-testing switches. 

In the special case of this algorithm where p lU =2 and p i:2 32 
1 for each bin i the bin counts are given by n c »2 2 . The 
algorithm starts with a stream of '1 's and *0's as input. A ' V 
represents an occupied ATM cell in a traffic stream. 
Similarly, a '0* represents an empty cell (no data). The data 
stream is presented to an accumulator. The first element is 
simply placed in the primary accumulator of bin 1. When the 
second element arrives, it is placed in the accumulator, 
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added to the previous element and the sura is stored in an of generating and capturing, in real-time, ATM traffic at 

N-element vector (array). The sum is also passed to the OC-3 rates. The card generates and captures traffic using a 

primary accumulator for the next bin (column) (which traffic profile which records the timing information of an 

similarly adds every two inputs, stores the sum, and passes ATM traffic stream, without any retention of the specific 
it to the next bin) (column). After an accumulator sum is 5 payload data within the cells. The card is capable of gener- 

stored, the accumulator is cleared (reset to 0) to await the ating and capturing traffic from 3 virtual circuits (VC). 

next input. Since the number of elements (rows) in each At the heart of the capture/generation card, is an FPGA. 

vector (column) of the array is limited, when the limit is The FPGA performs the ATM cell processing tasks, as well 

reached, an insertion pointer is reset to the beginning of the as serving as the controller of operation of the card. A 

array (so that the next value stored overwrites the previous microcontroller and an interface to a personal computer 

value). After a certain number of elements (or at the end of serial port arc incorporated on the card, 

processing) each column calculates a variance measure of its An ATM physical layer device supplies captured cells to 

array contents using the formula: the FPGA. This component also allows for ATM cell 

transmission, if enabled. RAM on the card stores the look-up 

einttr „ ettm 2 table for the headers the card is expected to capture. Logic 

crow * sums q — sum ~ c .... , . , r , _ Jf_ 

Var = crowticroyv-i)****™ on the card V*™ 1 * lt 10 be Pegged into the PCI slot of a PC, 

allowing a means of storing to disk the captured ATM traffic 
patterns. 

Where crow is the number of elements in the array, sum is The representation of the ATM traffic stream in the traffic 

the summation of the elements of the array, and sumsq is the profile file is based on encryption of the ATM cell header; 

sum of squares of the elements in the array, colnum is the 20 the cell payload is ignored. During traffic capture, the 

identification number of the column (the first column has payload is dropped and for traffic generation, the payload of 

colnum- 1, the second has colnum «2, etc.). the outgoing cells is hard-coded with a predetermined value. 

From the column variances, the Hurst-parameter is found The encoding scheme allows representation of the headers 

by the formula H=l-0/2, f rom number of active VCs (currently 3) as well as that of 

where 0 represents the slope of the variance-time curve on 25 an jdie/unassigned cell. By combining this encrypted traffic 

a log-log scale. Thus, the steeper the variance-time curve is, file wiln a lookup ta51e containin g the spe cific byte 

the smaller the corresponding Hurst -parameter value must values of the neadere fof each of the active va% one is able 

" - , , - . «- . . , to reproduce the timing of a traffic stream with specific 

A first method of constructing the traffic stream is based r . 

o n the comparison between the timestamp of the most recent _ p J; r * ~ 4 , . . r , , . , 

cell to the P timestamp of the previous cell. This would 30 *> r tra ^ Ration, the card is supplied with a desired 

indicate how much time has elapsed between cells. This lr ^ c profile, encoded as described above and a set of 3 

interval could then be divided by the minimum cell interval X™ 1 <* n headers * M generation proceeds, the traffic profile 

(for l%OC-3, 0.00028 seconds) to determine how many 15 decrypted and the specified headers loaded into the 

l%OC-3 cell periods had elapsed (these would be the outgoing SONET frame. As mentioned previously, the ATM 

number of 0's to insert into the array before inserting a 1. 35 cell payload is not considered, but is fixed to some prede- 

The major drawback of this method is that it would require termined value. Since on-board storage of the traffic profile 

floating-point division (as fractions of intervals would have is finite, either a traffic stream will be fed to the card via a 

to be taken into account). The resulting implementation PCI bus or a stored traffic stream will be cycled repeatedly, 

missed cells while the processor was busy doing the with a period of approximately half a second for 3 VCs. 

floating-point operation. 40 Configuration of the board for traffic capture produces a 

A second method involves no floating-point operations at traffic profile file, which is written to disk on a PC. Before 

all. In the algorithm, a *1' or a '0' is input to the accumulator capture can begin, a set of three ATM headers must be 

for the first column once every 0.00028 seconds (one downloaded to the board for the lookup table. These headers 

l%OC-3 cell interval time). Since the second column accu- are used for comparison with each header extracted from the 

mulator receives the sum of two numbers in the first, its rate 45 incoming SONET frame. A match between a received 

is half of the previous column (it receives an input every header and one from the look-up table results in a label being 

0.00056 seconds). Continuing this to the fifth column, its assigned in that cell period. Headers which do not match 

accumulator receives a new value once for every 32 cell those in the table are labeled as that of an idle/unassigned 

inputs (2 5 ), so its 'arrival interval* is actually 32 l%OC-3 cell. In this manner of header extraction, comparison and 

cell arrival times (0.00896 seconds). When the first cell 50 filtering, and labeling, the timing information from the 

arrives, its timestamp Ts„l is noted, and the end of the first incoming ATM stream can be recorded, 

interval Ts_end (Ts_l+CPU_clock-ticks_for_2 5 _cells) The card resides in a PCI slot (33 MHz, 5V) of a personal 

is calculated. The interval cell count is initialized to 1. When computer. Also required for operation of the card is a 

the next cell arrives, its timestamp is compared to the end of connection to a PC's RS-232 serial port. When configured 

interval value. If the timestamp is less than Ts_end, the cell 55 for traffic capture, the captured traffic profile will be written 

has arrived within the interval, and the cell count is incre- to hard disk at a rate of approximately 0.5 Gbyte/hour. The 

mented. If the timestamp is greater than Ts_end, the interval board's optical driver requires a multi-mode, 1300 nm 

is over. The interval counter is inserted into the H-meter wavelength, SC-style fiber optic connection, 

algorithm (this time at the fifth column) and the counter is Implementation for ATM over OC-3: An implementation 

then cleared. The next end of interval is calculated as 60 of the algorithm for where p lU -2 and p^-l is shown in FIG. 

Ts_end+CPU_clock-ticks_for_2 5 _cells. This method 2. The bin counts are given by n c ~2 c . The shift register 

counts the number of cells that arrive within a certain lime configuration has been used for the bins and the vectors. The 

period. It eliminates the work of the first four columns, and input is SONET information converted into a binary stream, 

involves no floating-point calculations in constructing the This information is stored and organized in an array of bins 

traffic stream 65 and vectors. 

Preparation of the Data: A device that prepares the infor- The data is processed as follows. Bin 1 receives two bits, 

mation for the algorithm is a network interface card capable adds them up, and sends the result to the vector Sj and to bin 
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2. Since bin 2 is not full yet, then two more bits are read into 
bin 1 and the result sent to s ± and to bin 2. This new result 
is stored in the first position of s 2 and of bin 2, while the 
previous values are shifted. Since bin 2 is now full, the two 
elements are added and the result send to s 2 and to bin 3. 5 

Traffic Characterization: Several calculations can be per- 
formed on the vectors s c to characterize the traffic, for 
example first order statistical measures like variance, 
correlation, and the coefficient of variation, or second order 
statistics like the Hurst parameter H and other fractal dimen- no 
sions. A methodology using variance-time curves is utilized 
as follows. The vector v of variances is calculated where 
each element v c contains the variance of the elements in the 
vector s c . The variances are scaled by dividing each element 
by the square of bin count. The information is represented by 15 
a plot of the logarithm of v versus the logarithm of the bin 
count. The traffic can then be characterized by parameter- 
ization of the variance-time curves. For example, the slope 
of the variance-time curve is related to the Hurst parameter. 

An example of a real-time plot of the coefficient of 20 
variation is shown in FIG. 10. The curves represent JPEG 
video traffic at different quality factors. FIG. 11 illustrates an 
application of the invention in traffic classification. The 
vertical axis is the log of the variance, and the horizontal axis 
is the log of the bin size index. The plots represent typical 25 
variance-time curves of transmissions of video, audio, and 
ftp (file transfer protocol) signals. A variance-time curve is 
a plot of the logarithm of the variance of the vectors agaiast 
the logarithm of two to the vector number. The plot for each 
stream is clearly distinct from the others. Since the plots can 30 
be constructed in real-time as the vectors are being 
calculated, this technique can be applied in real-time to 
characterize the traffic from measures of the variance-time 
curves. 

The process which generates the Hurst parameter from 35 
variance time curves is illustrated in FIG. 12. The Hurst 
parameter is calculated at different time intervals and a plot 
is constructed with the different values of H. The resolution 
of the graph depends on the lenlh of the time intervals. 

FIG. 13 shows an application of the algorithm for calcu- 40 
lation of the H parameter of the 'jump to hyperspace* scene 
of the StarWars movie. At the centre of the plot (left-hand 
side image) it is seen that the H parameter increases in value 
as the images changes. This is a motion JPEG encoded 
transmission of the video over an ATM network. 45 

FIG. 14 illustrates the application of the algorithm in 
classification of different traffic streams, A scene with the 
character R2D2 from the motion JPEG -coded StarWars 
movie over ATM is compared agains an audio transmission 
of the same movie. Similarly FIG. 15 shows that application 50 
of the algorithm to the observation of scenes of high activity 
from the motion JPEG-coded StarWars movie. 

The Hurst parameter (H parameter) provides information 
about the nature of the traffic additional to that which is 
currently available in the standards. It is a measure of the 55 
rate of change of burstiness with respect to changes in time 
scale. The contribution of the present algorithm for traffic 
characterization applications is the ability to calculated the 
H parameter in real-time. 

Traffic Classification: The characteristics of different traf- 60 
fic streams can be characterized by variance-time curves. A 
set of variance vectors v for different traffic streams can be 
used to train a classifier, for example a neural network. The 
variances can be calculated in real-time from the array of 
vectors s c and the classification can be perform at regular 65 
time intervals, for example every second. This information 
can be used for monitoring. 
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An example of traffic classification using variance-time 
curves is shown in FIG. 11. The signals represent FTP (top), 
video (center), and audio (bottom) over ATM. The signals 
have been characterized by the Hurst parameter H. 

The data in its original form, shown in FIG. 11, was 
presented to several network architectures in attempt to 
solve the classification problem. However, it was deter- 
mined that the strong linear trends in the time series made it 
impossible for the network to learn the subtle variations in 
the data. 

To transform the data into a suitable form for presentation 
to the network, the three time series of FIG. 11 were 
detrended. As the time series appear to be linear, first order 
detrending techniques were suitable. Two methods of 
detrending were considered; (i) a least-mean-square fit of a 
line to the data and (ii) detrending by finding the correlation 
of each series. Both methods removed the linear trends from 
the time series. 

The least -mean-squares method of detrending is well 
known. The slope and the intercept of a line that models the 
data were determined. Using the slope and intercept, the 
trend component of each element in the series was deter- 
mined and then removed from the original series, as shown 
in the equation below. When the data was prepared in this 
form, it was possible to design a network that could classify 
the three data types. 

SjS;- (a lope */;+ intercept) 

where: S rf »the detrended signal 
S ( =the original signal 
slope«the slope of the model of the line 
t^time corresponding to the sample point, i 
intercept=the y-intercept of the line that models the data 
curve 

It can be seen from FIG. 11 that audio data is highly 
correlated, video data is less but somewhat correlated and 
FTP data shows almost no degree of correlation. 

Noticing that the three data types could be distinguished 
by their degree of correlation, a simpler method of detrend- 
ing is provided in which firstly the slope and the intercept of 
each time series is calculated. Each time series is then 
detrended as follows 

This method finds the correlation of the signal by taking the 
absolute difference in successive signal amplitudes. This 
value is then raised by the intercept of the line that models 
the data curve. When the data is detrended in this manner, it 
is possible to design a network that could classify the three 
data types. Signals detrended by this method are shown in 
FIG. 16. 

Although data detrended by the method of least-mean- 
squares allowed a network to classify the data, the second 
method of detrending is less computationally intensive. The 
second method is equally successful in removing the linear 
trend information and it involved only addition and subtrac- 
tion operations whereas the method of lcast-mean-squares 
involved a multiplication operation. Reducing the complex- 
ity of preprocessing is targeted to classify the data in 
real-time. 

In order to solve this data classification problem, three 
neural network architectures can be attempted. First, a 
multi-layer feed-forward network may be considered. The 
weights are updated using a back propagation algorithm. 
However, it is not possible to determine a network of this 
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type that could learn to classify the input vectors. Even using We now employ a heuristic approximation: 
momentum, the network's sum-squared error becomes stuck 

in a local minimum and is never able to reach the error goal. max A a m a 

Next, a LVQ (Linear Vector Quantization) network can be *gi«-o) 

designed. The network consisted of 15 inputs, 3 tan- 5 

sigmoidal neurons in the hidden layer and 3 linear output wnere 
neurons. This network is able to successfully classify the 
three data types. 

Although it is not obvious by observing the input vectors m % = ">« M A*-nj-iiaj+ja 

of FIG. 16, the three classes of input vectors are linearly i-i.z.....i— J 
separable. This is determined by experimenting with a third 

network; a competitive network. Tlie competitive network and whefe w deno(es ^ , ^ QOt , , han x 

that solves this problem is shown m FIG. 17. The network - me floQr or ^n^tion ). In worGS) m a ^ 

consols of 15 inputs and 3 perceptions in the output layer. defined ^ follows . Divide ^ tjme in(erval(S)t) int0 a 

As perceptions are able to classtfy the three data types, it is number of smaller aon ^ yet i avpiag inter vals of length a. 

determined that the data must be linearly separable. This Count ^ mmbet of ceUs arrivin m each rf those smaU 

network architecture is favored over the LVQ design is * intervak aad ^ tne maximum of thoS6 numbers . result 

involves fewer weights and neurons. It can been seen FIG. ^ „ ^ exac( ^ similar> exce t that the 

18 that the competitive network found the three data clus- m overlapping . For example, if a-4, s-0 and 

terS- , , . . , ?n t— 16, the approximate calculation uses the intervals (0,4), 

Twenty percent of the input vectors are set aside for (sm/o i» and (13,16) whereas the exact calculation uses 

testing. Once training was completed, the network is tested ^ ^ fa (0> 4) > (i >5 ) > (2,6),(3,7) (12,15) and (13,16). 

on the remaining 600 input vectors. Afte. : 500 training In tne exact ^Wn 
epochs, the network is able to classify all 600 input vectors 
into three classes without error. 

Traffic Prediction: The array of vectors s c can be used to 25 *535>/*"* 
predict traffic. A set of vectors s^ and s,- can be used as a 
training set for a neural network predictor. Once the neural 

network is trained it can take the input from a vector s, , in we take the maximum value of A^ +a for all possible values 

real-time and predict elements of s. This information can be of x ( i e - x=s,s+l,s+2, . . . ,t-a). In the approximation m„ , 

used by different layers of protocols to handle congestion, 30 we take the maximum of only the values of A^ corre- 

call admission control, and in general to monitor a source. sponding to disjoint intervals (x,x+a) (i.e. x«s,s+a,s+ 
Performance Prediction: The following describes a 

2a,... ,t-a). 

method for estimating the maximum rate of constant bit rate ™c calculation of the values m„ a can be done efficiently 

traffic which can be added to a given traffic stream without m real time using a method similar to that described above 

overflowing a buffer of fixed size during a time window of 35 for th e calculation of the variance-time curve. We employ a 

fixed duration. Let p denote the rate at which cells are scries of m bins labelled 1,2, ,m. Each bin is made up 

removed from a buffer of size B and let A st denote the of two processing units which can store one integer each. A 

number of cells which arrive to the buffer in the time interval stream of bits representing the presence "0" or absence "1" 

(s,t). It is well known that the queue length (i.e. the number °f a cel1 (packet) in each transport location is fed into the 

of cells in the buffer) at time t is given by 40 firel bin - Each bin ^gins in a cleared state, with no integer 

stored in either processing unit. If a bin in a cleared state 

receives an integer, it stores the integer in the first of it's 

processing units. If a bin which is not in a cleared state (i.e. 

it has an integer stored in one of it's processing units) 

„,.«,» x n i • mi-, in 45 receives an integer, it will place the new integer in its second 

- 7„ iT "™ K f ~ S) - eXa T? ,\ m ,h , ' T Processing unitfcalculate the sum of the two integers in its 

pose that we know W„ for every point (s,t) in the plane The V ^ uni , and , he sum , 0 lhe nex( bfa ^ bin t 

queue length at time 13 is the maximum value of W„ along * {q * n 2> ^ * |q ^ 3 ^ ^ ^ of ^ 

the line segment AB of HG. 19. . process is that, at any time, an integer stored in a processing 

If the maximum queue length is required in the interval r . . * *i_ M c — .* u » c Z 

r rt \ - 11 1. r«, t 50 unit of bin c represents the sum of 2 consecutive bits fed 

0,t 3 ), we can s.mp ly take the maximum of W along a 1 the £ ^ cofTesponds l0 lhe number of 

line segments parallel to AB (for example a t,, t, etc.) and ^ in f ^ rf y> ^ 

then find the largest of those maximum values. However, . Y ,u- „, „ m r tUat . (a<>.,^u„a 

... . , 6 . 1 • i_ • *«» *u bin also passes this sum to a maximum niter (described 

this is equivalent to iust taking the maximum of W c . over the , . . r . , , . j , . tU L « ♦ e 

. \ . J . iT ,^„, § . .. below) and returns to a cleared state, erasing the contents of 

entire triangular region ABC. We can envision taking this . . 7 . 

• j-n: 11 . . jhp mw « . 55 both processing units. 

maximum in a different way, as illustrated FIG. 20 We nrst . , °. . . . . . n . „. . 

„ , , . r ... } l . r . i* . Associated with each bin is a maximum niter. We thus 

find the maximum of W„ along each of the line segments . au . . . 1 - ^ mnmt 

_ , r , L % i_ i -j have m maximum filters labelled 1,2, ... m in the same way 

paralle to CB (for example , the segments labeled t=s, . . . „ . f . , 

\ j .l c j .u i . r »u t as the bins. Each maximum filter consists of an input port, 

t=s+a 15 t«s+ou) and then find the largest of those values. , . , » , , ^ n„^u ~c 

tr 1 ' ... .i. a memory, a counter, an alarm and an output port. Each or 

If we find the maxunum in this way, we can express the . f , . . • . ^nf ■ f 

. c ii • ,i • cc j ■ # u * ■ 60 these four elements can store one integer. When an integer 

maxunum number of cells in the buffer dunng the time ^ ^ of a maj[mlum fiker from 6 ils 

mterval (s,t) by the following: corresponding bin, the filter compares the integer in the 

input port to the integer in memory. The filter then discards 

M *W = the smaller of the two integers, moves the larger of the two 

t u/ \ ( u , i I * 65 integers into its memory, increments the integer in its 

ma* { max = max < max VV, } = max ( max A SwXHX - pa. *» . 

v€(wM*euv) J tte(0.f-j^xcu/-o ^ oG(0.r-j) *x€(m-o> counter by 1 and compares the counter to the alarm. II the 

integers match, then the integer in memory is moved to the 
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output port, thus overwriting the integer in the output port, 
and the integer in memory is set to 0. 

Associated with each bin is a vector similar to the vectors 
s c described in the method for calculating variance-time 
curves. Each integer which appears at the output port of 
maximum filter c is passed to vector s c which records 
integers passed to it in sequence, dropping the least recent 
ones when full. If a is an integral power of 2, say a=2 c , then 
m„ a is the maximum of the most recent 



integers passed to maximum filter c. In practice 



30 



15 



1^1 



may be quite large when a is small and so we can not store 
all 



1^1 



elements at once. This is the reason for the maximum filters. 
The value of m w a can be obtained by finding the maximum 
of the most recent 



20 



18 



buffer does not exceed the buffer capacity B for the duration 
of the time window of length A. We can calculate this 
quantity at any time by performing a binary search for this 
value of ix, A binary search consists of evaluating the 
function M,_ Aj Xu) for two values of say pt L and n Rj and 
continually updating the values of [i L and fi R such that the 
size of the interval (ji L , p R ) is decreased by half at each 
iteration and always contains the value of ju for which 
M f _ A f/ 00 s B. The method of binary search is well known and 
can be found in the literature. 

Projection of Long Term Effective Bandwidth: Let /^t, 
A,B) denote the effective bandwidth described above: i.e. 
l*^p(t,A,B) represents the minimum value of pL for which 
M,_ A /(/*)=B where M,_ Ar '(u) denotes the approximation 
described above for the maximum number of cells in the 
buffer during the time window (t-A,t). Suppose that at time 
t we have been observing a traffic stream for some length of 
time to and wish to estimate the bandwidth ^ g ^\ y Q,B) 
required to prevent buffer overflow during some time win- 
dow of length Q where Q>w. We will assume that co^^S for 
some integer J (otherwise we can simply decrease co 
accordingly). Now define E^t^) to be a measure of the 
average value of M f _ A ,'(^) over the interval of length co 
when A»2*S. Specifically: 



25 



2 j-k 



integers in the vector s c where a=2 c providing that the alarm 
in maximum filter c is set to c7a=2~ c S. We assume that 6 is 
an integral power of 2, say 5«2 rf , so that the alarm in filter 
c must be set to 6/a=2 rf ~ c , This effectively breaks up the 
calculation of ni„ a by dividing the interval(s,t) into a num- 
ber of smaller intervals of length 6, taking the maximum 
over each of the smaller intervals and calculating m^ a as the 
maximum of those. 

In this manner we can easily keep track of the values of 
m,_ A ^ a for any value A of interest for a= 1,2,4, . . . where t 
represents the current time. We are interested in the maxi- 
mum number of cells in the buffer in the time interval (t-A, 
t) which is given by 

M f -nj(+i) = max \m°_ Aj -fta. 

For each value of c~l,2, . . . ,m we have a value of m,_ Af a 
where a=2 c . We subtract /*a«2 c // from this value for each of 
c-1,2, . . . ,m and find the maximum of the m resulting 
values. We estimate M t ^J^i) by fitting a quadratic function 
to a small number of points of the form (a,m,_ Af "-/"a) with 
values of a centred around the integral value of a where the 
maximum occurred. The operation referred to as "fitting a 
quadratic function" refers to finding the parameters of a 
quadratic curve which passes through or as near as possible 
to the points in question. Methods for fitting a quadratic 
function are available in the literature. 

Note that it is only the last step which depends on the 
value of ft, the service rate for the buffer. We can thus repeat 
the last step for a number of different values of n without 
having to repeat the measurements involved in filling the 
vectors s c . 

The effective bandwidth is defined as the minimum value 
of p for which the maximum number of cells M,_ A ^) in the 



30 In words, we divide the interval of length (a^b into 
2C 7 -*) intervals of length A =2*6 and estimate the maximum 
queue length over each of these intervals. The average of 
these values is E^t^w). In practice we have seen that the 
variation of E^t^w) with k can be approximated by the 

35 following relation: 

Thus if we fix t and fi and plot the logarithm of E^t^u) as a 
40 function of k, we get something that looks close to a straight 
line. Our method for projecting the effective bandwidth over 
the long term Q consists of estimating the coefficients a(t*w) 
and b(t^w) by fitting the points (k,log 2 [E*(t/0]| kts0 il» ... J to 
a straight line and then extrapolating a value for M,_ n ^u) by 
45 extending the line out further. Fitting a straight line consists 
of finding the line that passes closest to the points and 
methods for doing this are available in the literature. We thus 
estimate M,_ a Xu) according to: 

50 

Efficient Calculation of E^t^): We can calculate the 
values of E^lji) efficiently in real-time as follows. Let m ; c 
denote the j th integer passed from maximum filter c to its 
corresponding vector s c . If we divide the time interval 

55 (j^0 + l)^) mt0 intervals of length 2 C then mf is the maxi- 
mum of the numbers of cells arriving in each of these 
intervals. With each value of c we associate a column of 
processors. Each processor has one register which can store 
an integer, one input and one output. When the register is 

60 empty, the processor waits for an integer to appear at its 
input and stores it in the register upon arrival. When the 
register is full, the processor waits for the arrival of a second 
integer at the input. Upon arrival of the second integer, the 
processor outputs the maximum of the two successive inputs 

65 and clears the register. These processors are similar to the 
bins referred to above except that they output the maximum 
of the two inputs instead of the sum. In each column of 
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processors, the output of one processor is fed into the input transmission, such that some packet transport locations 

of the next processor in the column. Thus, if successive in a train contain packets and some packet transport 

values of m? are passed to the first processor in the column locations are empty and such that, when there is more 

c of processors, then, at any time, the k th processor in than one different source, the train contains packets 

column c contains the maximum of the most recent 2 k values 5 from the different sources in a sequential arrangement 

of m y c passed to the column of processors. We can think of ^ determined by the network; 

arranging these processors in a rectangular array with the monilormg a train of packet transport locations to deter- 

processors in each column arranged vertically from top to mine which ket Uusspori i ocat ions are empty and 

bottom as in FIG. 21. We denote the k th processor in WQich COQtam a ket; 

column c by P^. Also, we refer to the collection of the k tb 10 . 

processors from all the columns (i.e. P^, . . . .P^J as the generating a senes of data elements each corresponding to 

k th row of processors. At any time t, the k th row of a "fFf™?. 006 ° f , th * ^ tranSp ° rt loCa ' IOnS and 

processors contains the values of m. A," for "=1.2,4 each identifying whether the respective packet trans- 

n ■ j- II u r n a a *u i port location is empty or whether the respective packet 

Periodically, each row of processors is polled and the values ^ y J . r F 

c k a j * i i * w k 1/ \ e ~ „ transport location contains a packet; 

of m f 2 k 6r are used to calculate M t _ 2 k bt (ji) for various 15 * r ' 

values of //. These values are in turn fed into row of and carrying out simultaneously at a plurality of different 

processors which output the sum of two successive inputs so time ^ales statistical analyses on the series of data 

that the i th processor in the k th row of this summing array elements to determine the characteristics of the trans- 

always contains the sum of the 2* most recent values of missions. 

M^VC")- T^sc provide the value of the sum in the 20 2. The method according to claim 1 wherein the statistical 

expression for E^t^) above if we take i-J-k. In this manner analyses are carried out in real time, 

we can, in real-time, efficiently keep track of all the quan- 3 - The method according to claim 1 wherein the data 

tities necessary to make a long-term effective bandwidth elements are stored and the statistical analyses are carried 

projection. out subsequent to completion of the transmissions. 

Call Admission Control: The estimation of effective band- 25 The method according to claim 1 wherein the statistical 

width can be used in two ways. In the first way, we have a analyses are carried out by providing for each different time 

number of existing sources sending packets into a network. scale a respective one of a plurality of registers and entering 

We can estimate the effective bandwidth of the traffic from information from the data elements into each register 

all of those sources at some point in the network. If another sequentially. 

source requests to begin sending additional traffic through 30 5- The method according to claim 4 wherein the data 

the same point in the network the network will reject the elements comprise data bits defining "0" when the respec- 

request if the sum of the effective bandwidth calculated and tivc packet transport location is empty and "1" when the 

a peak bandwidth proposed in the request exceeds the respective packet transport location contains a packet and 

capacity of that section of the network. Alternatively, the wherein the Information for each register is obtained by 

network could decrease the effective bandwidth of the 35 adding the bits from a next adjacent previous register, 

existing traffic by allocating more buffer space. 6 - The method according to claim 5 wherein the analyses 

The second application is slightly different. Suppose that include for each register calculating the sample variance of 

a given traffic stream (for example a video-on-demand a set of successive observations of the register contents and 

service) is fed from one location on the network to a from the variances estimating a value of the Hurst parameter 

multiplicity of other locations on the network. If another 40 H, which is the slope of a line which approximates the 

location on the network which is not receiving the trans- behavior of a plot of the logarithm of the variances of the 

mission requests reception of the transmission, a connection registers versus the values of the sequential indices of the 

will have to be set up from the location providing the service registers. 

to the location requesting it In this call setup, the server can 7. The method according to claim 1 wherein the packets 

specify the equivalent bandwidth to nodes on the network 45 in the train are provided by a plurality of different sources 

which must carry the traffic from the server to the new and wherein the data elements arc arranged to identify a 

recipient. Even if these nodes do not use a protocol which packet transport location as containing a packet only when 

involves the effective bandwidth, the effective bandwidth th e packet is identified from the id data as provided by a 

could simply be sent to them in place of the peak bandwidth selected one of the sources such that the characteristics 

and those nodes will set up the connection as if this was the 50 determined relate to only the transmissions from the selected 

actual peak bandwidth. Alternatively, the server could send source. 

both a buffer requirement and a corresponding equivalent 8. The method according to claim 1 wherein the packets 

bandwidth requirement and the other nodes could react in the train are provided by a plurality of different sources 

accordingly by making use of a protocol which employs and wherein the data elements are arranged: 

information about effective bandwidth. 55 to identify a packet transport location as empty when no 

What is claimed is: packet from any of the sources is contained: 

1. A method for deriving information related to charac- to identify when a packet transport location contains a 

teristics of transmissions in a packet network comprising: packet from the id data as provided by each one of the 

providing a packet network for carrying a plurality of sources which source provided the packet. 

transmissions from at least one source in which the 60 9. The method according to claim 1 wherein the charac- 

transmissions from the or each source are divided into teristics determined from the data elements are used to 

a plurality of sequential packets each packet having provide a calculation of an effective bandwidth of the 

address data defining an intended address, information transmissions. 

data defining information to be transmitted and id data 10. The method according to claim 1 for determining 

defining a source identity; 65 whether a source additional to a plurality of existing sources 

the network defining a train of sequential packet transport of packet transmissions, each source having a predetermined 

locations into which packets are loaded for peak rate of packet transmission, can be connected to a 
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packet network, a transmission medium of the network 14. The method according to claim 13 wherein the data 

having a predetermined maximum allowable peak rate of elements comprise bits which define "0" when the respective 

transmissions, including the steps of: packet transport location is empty and "1" when the respec- 

carrying out in real time the statistical analyses on the tive packet transport location contains a packet and wherein 
series of data elements to determine an effective band- 5 the information for each register is obtained by adding the 

width of the transmissions from the existing sources; bits from a next adjacent previous register, 
and calculating whether the additional source can be 15. The method according to claim 14 wherein the analy- 

connected by comparing the effective bandwidth, the sis includes for each register calculating the sample variance 

predetermined peak rate of packet transmission of the 0 f a set 0 f successive observations of the register contents 
additional source and the predetermined maximum 10 am i from tne variances estimating a value of the Hurst 

allowable peak rate of transmissions of the transmis- parameter H, which is the slope of a line which approxi- 

sion medium of the network. _ ma tes the behavior of a plot of the logarithm of the variances 

11 The method according to claim 1 for determining of the istere ymu& the values of me ntial indices of 

whether a source additional to a plurality of existing sources ^ re aj s t e rs 

of packet transmissions, each source having a predetermined *^ method aca) Q whefein ^ 

peak rate of packet transmission, can be connected to a . . t . . . . , to . - . 

second packet network, a transmission medium of the sec m < he < ram ar » P r ° vided bv a P luraht > ' of d f e ™ 1 so »"« 8 

ond packet network having a predetermined maximum and wherein the information .s arranged to identify a packet 

allowable peak rate of transmissions, including the steps of: transport location as containing a packet only when the 

connecting the additional source to the packet network; packet is identified from the id data as provided by a selected 
generating said series of data elements from transmissions 20 0I f of the *° ur 1 oes such ! hat the characteristics determined 

from said additional source to said packet network; rclate «° ° nlv «•* transmissions from the selected source, 

carrying out in real time the statistical analyses on the . 71,6 method ac °° rdm 6 *> c aim 12 where.n the packets 

series of data elements to determine an effective band- m ' he ' ram ar f P r ™ ded bv ». rahty of different sources 

width of the transmissions from said additional source; and wherem tne "Nation ■ ranged: 
and calculating whether the additional source can be 25 t0 identify a packet transport location as empty when no 
connected to the second packet network by using the P acket from an y of the •«» 15 contained; 

effective bandwidth of the transmissions from said to identify when a packet transport location contains a 
additional source. packet from the id data as provided by each one of the 

12. A method for deriving information related to the sources which source provided the packet, 

characteristics of transmissions in a packet network com- l*- The method according to claim 12 wherein the char- 



prising: 



acteristics determined from the information are used to 



providing a packet network for carrying a plurality of P rovide a calculation of an effective bandwidth of the 

transmissions from at least one source in which the transmissions, 

transmissions from the or each source are divided into « 19 ^ melnod according to claim 12 for determining 

a plurality of sequential packets each packet having whether a source additional to a plurality of existing sources 

address data defining an intended address, information of P acket transmissions, each source having a predetermined 

data defining information to be transmitted and id data P cak ratc of P ackct transmission, can be connected to a 

defining a source identity packet network, a transmission medium of the network 

the network defining a train of sequential packet transport 40 havmg a predetermined maximum allowable peak rate of 

locations into which packets are loaded for transmissions, including the steps of: 

transmission, such that some packet transport locations carrying out in real time the statistical analysis on the 

in a train contain packets and some packet transport information to determine an effective bandwidth of the 

locations are empty and such that, when there is more transmissions from the existing sources; and 

than one different source, the train contains packets 45 and calculating whether the additional source can be 

from the different sources in a sequential arrangement connected by comparing the effective bandwidth, the 

as determined by the network; predetermined peak rate of packet transmission of the 

monitoring a train of packet transport locations to deter- additional source and the predetermined maximum 

mine which packet transport locations are empty and allowable peak rate of transmissions of the transmis- 

which contain a packet; 50 sion medium of the network, 

generating information defining which packet transport 20 ^ method according to claim 12 for determining 

locations contain a packct and the empty locations whether a source additional to a plurality of existing sources 

therebetween- °^ P ac ^ et transmissions, each source having a predetermined 

and carrying oui simultaneously and in real time a series P eak " te ° f P acket transmission, can be connected to a 

of statistical analyses on the information at a plurality 55 sec 1 ond P acket netw ° rk ' a Emission medium of the sec- 

of different time scales to determine the characteristics ond P a f et network havu, g a . predetermined maximum 

of the transmissions allowable peak rate of transmissions, including the steps of; 

13. The method according to claim 12 wherein the infer- connecting the additional source to the packet network; 

mation comprises a series of data elements each correspond- generating said information from transmissions from said 

ing to a respective one of the packet transport locations and 60 additional source to said packet network; 

each identifying whether the respective packet transport carrying out in real time the statistical analysis on the 

location is empty or whether the respective packet transport information to determine an effective bandwidth of the 

location contains a packet and wherein the statistical analy- transmissions from said additional source; 

ses are carried out by providing for each different time scale and calculating whether the additional source can be 

a respective one of a plurality of registers and entering 65 connected to. the second packet network by using the 

information from the data elements into each register effective bandwidth of the transmissions from said 

sequentially. additional source. 
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21. A method of generating packet transmissions for 
simulating a source having required transmission character- 
istics for transmitting on a packet network comprising: 

providing a packet network for carrying a plurality of 
transmissions from at least one source in which the 5 
transmissions from the or each source are divided into 
a plurality of sequential packets each packet having 
address data defining an intended address, information 
data defining information to be transmitted and id data 
defining a source identity; 10 

the network defining trains of sequential packet transport 
locations into which packets are loaded for 
transmission, such that some packet transport locations 
in a train contain packets and some packet transport 
locations are empty and such that, when there is more 15 
than one different source, the train contains packets 
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from the different sources in a sequential arrangement 
as determined by the network; 

from a statistical analysis of previous actual transmissions 
on the network, generating for the simulated source the 
required characteristic; 

generating from the required characteristic a series of data 
elements each corresponding to a respective one of the 
packet transport locations and each specifying whether, 
in a simulated train of packets, a respective packet 
transport location is empty or whether the respective 
packet transport location contains a packet; 

and creating from the data elements the simulated packet 
train. 

* * * * * 
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